Mining coherence in time series data
نویسندگان
چکیده
This paper presents work on modeling coherence between time series data. Work is based on the elaboration of formula that computes the distance between time series. Based on the computed distances, the method exploits the closest neighbor algorithm and leads to the construction of a phylogeny-clustering tree. Using car sales data (available on the www) we demonstrate work done and present preliminary results. We discuss implications of the work performed in correlating time series data with documents including such data. Introduction Time series occur in many aspects of economic and social activity. Time series organization of data implies time stamping of individual observations. Observations may represent economic or social activity or even life critical measurements such as those acquired when a patient is under monitoring in a critical care unit. Time series data modeling has been an active area of research in statistics. A variety of models exist, which manifest interest and provide the interested analyst with analysis tools (Box and Jenkins 1976). Expanding interest in data mining and knowledge discovery has contributed to an increase of research awareness in learning using time series data (Morik 2000). This paper reports preliminary research results and modeling activity in integrating time series data with documents. Motivation originates from the fact that analysts do use time series data to prepare reports. Reports and time series data reside in distributed information archives. There is an emerging need to mine for knowledge over such archives, which contain both reports and data. Specifically, the paper presents a methodology, which investigates similarity between time series data and reports results using exemplar time series drawn from car sales (see legend in Figure 1, in the text). Data-mining methodology is realized by the introduction of a novel algorithmic process and related formulas, for discovering similar and indicative patterns in time-series collections. Final outcome is the clustering of time-series into similar-groups, visualized by the appropriate customization of a phylogeny-based clustering algorithm and tool.
منابع مشابه
Evaluation of Sentinel-1 Interferometric SAR Coherence efficiency for Land Cover Mapping
In this study, the capabilities of Interferometric Synthetic Aperture Radar (InSAR) time series data and machine learning have been evaluated for land cover mapping in Iran. In this way, a time series of Sentinel-1 SAR data (including 16 SLC images with approximately 24 days time interval) from 2018 to 2020 were used for a region of Ahvaz County located in Khuzestan province. Using InSAR proces...
متن کاملForecasting Gold Price using Data Mining Techniques by Considering New Factors
Gold price forecast is of great importance. Many models were presented by researchers to forecast gold price. It seems that although different models could forecast gold price under different conditions, the new factors affecting gold price forecast have a significant importance and effect on the increase of forecast accuracy. In this paper, different factors were studied in comparison to the p...
متن کاملFuzzy clustering of time series data: A particle swarm optimization approach
With rapid development in information gathering technologies and access to large amounts of data, we always require methods for data analyzing and extracting useful information from large raw dataset and data mining is an important method for solving this problem. Clustering analysis as the most commonly used function of data mining, has attracted many researchers in computer science. Because o...
متن کاملThe Effect of Rainfall Parameters on Runoff Using Wavelet Coherence Measure
In this research wavelet coherence measure is implemented for evaluating the relations and effect of rainfall parameters over many years on runoff fluctuations that is for testing proposed linkages between two time series. In this way, monthly Hydro climatological as 3 rainfall stations, one runoff in the outlet of Ardabil plain were used. The results illustrate that 8-12 and 8-16 month modes o...
متن کاملModeling and prediction of time-series of monthly copper prices
One of the main tasks to analyze and design a mining system is predicting the behavior exhibited by prices in the future. In this paper, the applications of different prediction methods are evaluated in econometrics and financial management fields, such as ARIMA, TGARCH, and stochastic differential equations, for the time-series of monthly copper prices. Moreover, the performance of these metho...
متن کامل